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ABSTRACT 

Two recent papers have claimed detection of non-Gaussian features in the COBE DMR sky maps 
of the cosmic microwave background. We confirm these results, but argue that Gaussianity is still not 
convincingly ruled out. Since a score of non-Gaussianity tests have now been published, one might 
expect some mildly significant results even by chance. Moreover, in the case of one measure which yields 
a detection, a bispectrum statistic, we find that if the non-Gaussian feature is real, it may well be due 
to detector noise rather than a non-Gaussian sky signal, since a signal-to-noise analysis localizes it to 
angular scales smaller than the beam. We study its spatial origin in case it is nonetheless due to a sky 
signal {e.g., a cosmic string wake or flat-spectrum foreground contaminant). It appears highly localized 
in the direction h = 39.5°, I = 257°, since removing a mere 5 pixels inside a single COBE beam area 
centered there makes the effect statistically insignificant. We also test Guassianity with an eigenmode 
analysis which allows a sky map to be treated as a random number generator. A battery of tests of this 
generator all yield results consistent with Gaussianity. 

Subject headings: cosmology: cosmic microwave background 



1. INTRODUCTION 

The detection of fluctuations in the cosmic microwave 
background (CMB) by the Differential Microwave Ra- 
diometer (DMR) on board the Cosmic Microwave Explorer 
(COBE) satellite (Smoot et al. 1992) began a new era 
for studies of the Universe on large scales. Current and 
planned experiments offer the promise of tight constraints 
on cosmological parameters, including both quantities im- 
portant for observational astronomy and inflationary pa- 
rameters which may provide the ultimate testing ground 
for fundamental particle physics, relating phenomena on 
the largest and smallest observable scales. 

A key cosmological constraint from observations of the 
primordial density field is its general statistical nature. In 
most inflationary scenarios, the density is a Gaussian ran- 
dom field, (although see, e.g., Peebles 1999). This implies 
that the joint probability distribution of the N ~ 4000 
temperatures in the galaxy-cut DMR sky map is a mul- 
tivariate Gaussian. In contrast, topological defect models 
predict a non-Gaussian density field {e.g., Avelino et al. 
1998). However, an analysis by Pen et al. (1997) suggests 
that defects are inconsistent with the observed power spec- 
trum of fluctuations. At present, inflationary models, or at 
least generically Gaussian models, dominate the literature 
on large-scale structure. 

It is therefore quite intriguing that two groups, Ferreira, 
Magueijo & Gorski (1998, hereafter FMG) and Pando, 
Valls-Gabaud & Fang (1998, hereafter PVF), claim that 
the CMB fluctuations measured by COBE DMR are non- 
Gaussian. If substantiated, these results could potentially 
rule out standard inflation as the primary mechanism for 
cosmic structure formation in the early Universe. The re- 
sults are all the more surprising because the literature 
documents considerable previous effort to identify non- 
Gaussianity in the CMB (see references given in ^^), all of 
which failed. 



In this Letter, we revisit the question of whether COBE 
DMR rules out Gaussianity. We begin with a series of new 
tests of Gaussianity based on eigenmode analyses of the 
DMR map (§||). We then consider the results of FMG and 
PVF in Section |[ exploring their sensitivity to individual 
modes and spatial features as well as the significance of 
these results in light of the other statistical tests which 
have been applied to the data. We present our conclusions 
in Section 

2. STATISTICS OF EIGENMODES 

We analyze a sky map formed of the combined 53 and 
90 GHz four year DMR data pixelized at resolution 6 in 
Galactic coordinates, with the "custom" Galaxy sky mask 
(Bennett et al. 1996), and with monopole and dipole con- 
tributions removed as in Tegmark & Bunn (1995). The 
resulting data set consists of temperatures for 3,881 pixels 
in the sky, which we arrange in a vector x. We take the 
covariance matrix C = (xx*) to be of the form C = S-l-N, 
where the noise matrix N is diagonal and the signal matrix 
S is a Harrison-Zel'dovich power spectrum normalized to 
PS = 18 fik {e.g., Bennett et al. 1996). 

The pixelized data are both correlated and noisy, hence 
we subject the map x to both a principal component 
analysis (PGA) and a signal-to-noise eigenmode analysis 
(SNA), two standard astrophysical tools. Both of these 
procedures involve expanding the data in a new basis, 
the eigenvectors of C for the PCA case and the eigen- 
vectors of N~^/^SN~^/^ for the SNA case. The eigenvec- 
tors are sorted by decreasing eigenvalue and normalized 
so that the expansion coefficients have unit variance. For 
the SNA (Bond 1994; Bunn & Sugiyama 1995; Tegmark 
et al. 1997), the modes are listed in order of decreasing 
signal-to-noise level. For the PCA, the modes explain suc- 
cessively less and less of the variance in the data. Since 
the DMR noise per pixel does not fluctuate much, the 
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two methods give similar results. The first few hundred 
modes contain essentially all the cosmological information, 
and probe successively smaller angular scales (Bond 1994; 
Bunn & Sugiyama 1995; Bunn & White 1996). We use the 
top 250 modes for the Gaussianity tests described below. 

The purpose of this exercise is two-fold: First, we can 
determine how many cosmologically significant degrees of 
freedom a given statistical test should consider. Second, 
the decomposition into uncorrelated eigenmodes allows the 
data to be cast as a list of random numbers which, under 
the null hypothesis that the DMR data are Gaussian|^, will 
be independent and normally distributed. Although the 
statistics of these samples may not be completely testable 
in practice, we can still constrain general properties of the 
COBE data. 

We run both the hsts of 250 PGA and SNA entries and 
the entire list of 3877 numbers (the rank of the covariance 
matrix after monopolc and dipole subtraction) from the 
PGA basis through a smattering of tests, first for Gaus- 
sianity of the individual list elements. The null hypothesis 
cleanly passes Kolmogorov-Smirnoff and ^-square tests, 
along with tests of cumulants up to fourth order and of 
the significance of the top few outliers. None of these 
tests manage to reject the Gaussian null hypothesis with 
95% confidence. Note that these tests are sensitive only to 
the 1-point distribution of mode amplitudes, not to cor- 
relations between modes. This is strong though not ir- 
refutable evidence that if the DMR data are non-Gaussian 
then mode correlations, not mode amplitudes, are respon- 
sible. 

The next step in testing the Gaussian hypothesis is to 
look for mode correlations. This is a difficult thing to do in 
any exhaustive way, even for our short lists of 250 elements, 
and a thorough treatment of this problem is beyond the 
scope of this Letter. We note only that no correlations 
were detected above the 95% confidence level in tests of 
second, third and fourth order iV-point correlations. We 
also used mode amplitudes to simulate rolls of a die and 
examine the one- and two-point distributions of outcomes 
to see if the die is loaded. If one wishes to play dice with 
the Universe, evidently it would be a fair game. 

3. REPORTS OF NON-GAUSSIANITY 

The above tests of the GOBE data (26 in all) join a siz- 
able list of previous results without significant detections 
of non-Gaussianity. For the 53 GHz DMR 1 year data, the 
three-point function was studied by Luo (1994, one test) 
and Hinshaw et al. (1994, two tests) while Smoot et al. 
(1994) considered the topological genus and kurtosis (two 
tests). For the 53 and 90 GHz DMR 2 year data, Hinshaw 
et al. (1995) studied the equilateral and pseudocollapsed 
three-point function at three £-cuts with 12 tests in total; 
the most extreme gave a 98% non-Gaussianity detection, 
but was deemed to suffer from a known noise problem. 
Kogut et al. (1996) tested the DMR 4 year data for three- 
point correlations, genus and peak correlations (4 tests in 
all), while Heavens (1998) analyzed this same data set us- 
ing an optimized bispectrum statistic on 5 different scales. 
Gaztafiaga et al. (1998) performed 5 variance-of-variance 

'^Note that as long as the true CMB sky is Gaussian, our data 
set will be Gaussian as well: Both the smoothing done by the DMR 
beam, our galaxy cut and our monopole and dipole removal are linear 
operations, and all linear operations preserve Gaussianity. 



tests with the strongest rejection of Gaussianity being at 
the 91% level. Most recently, Diego et al. (1999) concluded 
that the DMR data were consistent with Gaussianity in a 
partition function analysis. 

On the other hand, there are two Gaussianity tests 
which the data reportedly fail. One is based on the bis- 
pectrum statistic of FMG (at 98% confidence) and other 
on the fourth-order wavelet statistic proposed by PVF 
(at 99% confidence). These are strong signals of non- 
Gaussianity, but some caution is in order. Given that 
the DMR data have been subjected to the 32 other pub- 
lished tests cited above which provide no evidence of non- 
Gaussianity (not to mention the 26 reported here and any 
tests, including some of our own, which were not reported 
because they yielded null results), are the FMG and PVF 
results simply expected outliers in the distribution of test 
results? 

To address this point, suppose we try to rule out some 
null hypothesis by subjecting a data set to n independent 
statistical tests, and that the most successful one rules it 
out at a confidence level p, say 99%. How significant is 
this really? Let pi denote the confidence level obtained 
from the i*'* test, with Pmax = inax{pi} corresponding to 
the most successful test. The probability of getting a less 
extreme result is then 

n n 

P{Pm.. < P) = n P{P^ <P)=1[P= P" ■ (1) 

i=l 1=1 

For example, the most extreme S/N eigenmode coefficient 
in Section || is a 3.3-ct outlier. If that one coefficient was 
all we had, then we would reject Gaussianity at the 99.9% 
level. However, we have 250 independent numbers and 
equation (|^) shows that our level of confidence in rejection 
from that one extreme coefficient is only 0.999^^" w 78%. 
Similarly, the list of 34 published Gaussianity tests men- 
tioned above contains one which rules out the null hypoth- 
esis at the 99% level. If these tests were independent, we 
could reject Gaussianity with only 0.99^'* w 71% confi- 
dence. Of course the tests are not strictly independent. 
Yet if some of them capture only subsets of the informa- 
tion contained in the ~ 4000 GOBE data points, then they 
may be effectively independent of each other. With this in 
mind we examine the FMG and PVF tests in more detail. 

3.1. The wavelet test 

The detection reported by PVF of a non-Gaussian sig- 
nal is made with a measure of fourth-order correlations 
between wavelet coefficients of the DMR data. The co- 
efficients are obtained using a discrete transform of the 
northern sixth of the sky map, after the spherical plane 
of the sky has been projected onto the face of a cube. 
Taken alone, this measure reportedly gives a detection 
of non-Gaussian signal at 99% confidence. But we have 
more information, even about PVF's wavelets: a second 
projection of the sky map onto the opposite face of the 
cube (i.e., the opposite hemisphere of the sky) lies roughly 
at the 40% confidence level. Together, a joint two-faced 
wavelet analysis gives a weaker rejection of the Gaussian 
hypothesis, formally at 97% confidence. Furthermore, the 
PVF detection is claimed only for wavelets on one spe- 
cific scale even though they seek similar detections on two 
other scales but do not find them. Likewise, a detection 




Fig. 1. — The galaxy-cut COBE DMR map is shown in AitofT projection. The inset shows the five pixels whose omission makes the FMG 
effect go away, with the ellipse indicating the FWHM size and the projected shape of the DMR beam in that region. 



was sought but not found for third-order moments. With 
n = 2x3x2, equation ([|) predicts a much lower confidence 
level, 0.99^^ ~ 89%, for the claimed detection. 

In an analysis based on compact but smooth wavelets 
(e.g., Bromley 1994), we confirm the existence of a strong 
(99.6%) non-Gaussian outlier at the ll°-|-22° scales re- 
ported by PVF. Interestingly, we can make the entire non- 
Gaussian signal vanish by simply zeroing or flipping the 
sign of a single, modestly rare principal component ampli- 
tude (a 2.7-a fluctuation of the 90^^ mode). Also the 17*-^ 
eigenmodes from both the PGA and SNA strongly affect 
the wavelet detection. In the S/N basis, this mode has 
an amplitude of 2.1-a (slightly less in the principal com- 
ponent vector); by zeroing the amplitude of these modes, 
the wavelet statistic yields less than a 2-cr detection. Fur- 
thermore, when set to zero, a single spot in the sky the size 
of the GOBE beam (centered at 5 = 64.6°, I = 40.6°), also 
cuts the non-Gaussian signal down to a similar level. Of 
the six pixels in this spot, three are within one standard 
deviation of the expected noise, while two are at the 2.6- 
tT„ level and the third is a strong outlier at 3.1-cr„. (How 
this spot affects the PVF results depends on unspecified 
details of their analysis.) 

3.2. The hispectrum test 

FMG introduce a measure, if, based on averaged triplets 
of projection coefficients from even-multipole spherical 
harmonics. Non-Gaussian behavior is seen only at £ = 16, 
but FMG are careful to consider the fact that the bis- 
pectrum at eight other ^-values are individually consistent 
with the Gaussian hypothesis. The reported confidence of 
the non-Gaussian detection is 98%. 

There is nonetheless a possibility that the nine if val- 
ues given by FMG are sensitive to only a fraction of the 



information in the COBE data. Although there is some 
spherical harmonic mode coupling as a result of the sky 
mask, the odd ^ multipoles are largely missing as well as 
multipoles above £ = 18. There may also be dependence 
on localized noise. We emphasize this latter point by set- 
ting to zero the five pixels in the beam-size spot on the 
sky centered at Galactic latitude b — 39.5° and longitude 
t = 257°, shown in Fig. 1. The value of if^ falls from 
about 0.92 to 0.78, approximately a 98% detection on its 
own0, but well below 2-cr when taken in conjunction with 
the other eight if values shown in Fig. 2 (.98^ w 84% if 
the 9 values were uncorrelated) . Note that there is a sin- 
gle, rare pixel brightness value in the spot. In units of the 
expected noise fluctuations, it is at the level of — 3.5fTn, 
and zeroing it alone cuts /fg by 10%. 

The bispectrum statistic is also highly sensitive to in- 
dividual eigenmodes. Zeroing or flipping the sign of prin- 
cipal component 151 causes ifg to drop from 0.92 to the 
unambiguously Gaussian values of 0.61 and 0.20, respec- 
tively. Furthermore, there is sensitivity to S/N eigenmode 
224, a 3-cr fluctuation (the second most extreme of the 
first 250 modes) which is more strongly coupled to noise 
than cosmic structure (its S/N eigenvalue is 0.4). Zeroing 
this knocks /fg to 0.74; flipping the sign causes the value 
to fall to 0.52, a decidedly Gaussian value. Since both 
of these eigenmodes are dominated by noise rather than 
cosmic signal, it is possible that the source of the alleged 
non-Gaussianity is detector noise rather than GMB. 

^Note that the effect of zeroing the spot depends somewhat on 
details of removing monopole and dipole contributions to the DMR 
maps. If explicit removal (e.g., Tegmark & Bunn 1995) is not per- 
formed on the custom cut map, as is apparently the case in the 
bispectrum analysis of Magueijo, Ferreira &; Gorski (1999; Fig. 1 
therein), then the effect of removing the spot (actually a nearest- 
neighbor) is to lower to 0.66. 



4 



CO ■ 



1 

0.8 
0.6 
0.4 
0.2 







1 1 


1 1 , 


1 ' ' ' 


• 


FMG 






• 


o 


w/ o 


beam 


spot 




A 


w/ o 


mode 


152 












// M 


L 








A A - 
¥ \ - 




















\ - 










\a — 

V - 






1 1 








10 



15 



20 



Fig. 2. — The bispectrum statistic of FMG for various multipolcs 
I. The dark circles are the full galaxy-cut COBE data (cf. Fig. 1 
in FMG), and the open circles are the data after zeroing 5 pixels 
inside a single COBE beam centered at 6 = 39.5°, / = 257°). The 
triangles result when the amplitude of mode 151 (containing 0.15% 
of the total power) is set to zero. 



Note that in the above analyses, we systematically 
searched for the modes or pixels which affected the wavelet 
and bispectrum statistics the most. In each case we found 
only a few which cause more than an insignificant (sev- 
eral percent) change in these measures, and interestingly 
the significant modes or pixels were different for the two 
measures. Using randomly generated Gaussian skymaps 
selected for apparent non-Gaussianity similar to the DMR 
data, we also checked that it is quite common for a Gaus- 
sian map to give a wavelet or bispectrum measure that is 
sensitive to only a few individual pixels or modes, just as 
in the DMR case. 



modes and COBE-beam spots that we isolated for the 
wavelet and bispectrum statistics are candidates for po- 
tential non-Gaussian sources in the CMB. This latter pos- 
sibility would perhaps be more satisfying if both measures 
were coupling to the same non-Gaussian structure in the 
sky. However, this is not obviously the case, since both the 
bispectrum statistic and the wavelet measure show virtu- 
ally no sensitivity to the sky spots and eigenmodes which 
so dramatically affect the other. 

It is generally much easier to show that a bad random 
number generator is bad then to prove that a good one is 
good. Indeed, the results reported here fail to demonstrate 
that the CMB really is Gaussian. Conversely, the search 
for non-Gaussianity is also something of an uphill battle, a 
fight against the central limit theorem which causes both 
instrumental effects and the linear combinations involved 
in the eigenmode expansions to make things look more 
Gaussian. Therefore statistical measures should be tuned 
for the specific type of non-Gaussianity that physical mod- 
els predict. This approach is taken in many recent studies 
{e.g., Cayon & Smoot 1995; Ma gueijo 1995; Torres et al. 
1995; Gangui 1996; Gangui & MoUerach 1996; Ferreira & 
Magueijo 1997; Ferreira et al. 1997; Barrieiro et al. 1998, 
Lewin et al. 1999; Popa 1998) with an eye toward upcom- 
ing, high-resolution CMB data. 
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NSF Grant PHY 95-07695 and the use of supercomputing 
resources provided by NASA/JPL and Caltech/CACR. 
MT was funded by NASA though grant NAG5-6034 and 
Hubble Fellowship HF-01084.01-96Afrom STScI, operated 
by AURA, Inc. under NASA contract NAS5-26555. 



4. CONCLUSION 

The problem of the statistical nature of the CMB may be 
cast in a conceptually simple form: just use the observed 
temperature fluctuations to make a random number gen- 
erator, based on assumed statistical properties, and test 
its quality. Here we have used the 53-1-90 GHz COBE sky 
map to generate lists of putative random numbers with 
principal components and signal-to-noise eigenmodes. In 
both cases the one-point distribution is manifestly Gaus- 
sian. If the CMB on COBE scales is non-Gaussian, it is 
the result of correlations between modes. Unfortunately, 
exhaustive tests of mode correlations are not feasible. A 
few tests which can pick up a fair range of non-Gaussian 
behavior were performed and no evidence of mode corre- 
lations was found. 

Here we have also considered the two statistics which re- 
portedly detect non-Gaussianity in the COBE data. Both 
detections turn out to be fragile in the sense that they van- 
ish when a single DMR beam spot or a single eigenmode is 
removed. Moreover, we found that the detection by PVF, 
based on wavelets alone, was less significant than originally 
claimed. Even so, with the dozens of different Gaussianity 
tests that have now been published, it would not be sur- 
prising if a perfectly valid analysis rejected Gaussianity at 
say 98% confidence purely by accident. 

Our results cast some doubt on the significance of the 
claimed non-Gaussian behavior in the CMB. If the re- 
ported detections are real nonetheless, then the eigen- 
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